Adaptive Critic Based Adaptation of A Fuzzy Policy Manager for A Logistic System

نویسندگان

  • Stephen Shervais
  • Thaddeus T. Shannon
چکیده

We show that a reinforcement learning method, adaptive critic based approximate dynamic programming, can be used to create fuzzy policy managers for adaptive control of a logistic system. Two different architectures are used for the policy manager, a feed forward neural network, and a fuzzy rule base. For both architectures, policy managers are trained that outperform LP and GA derived fixed policies in stochastic and non-stationary demand environments. In all cases the fuzzy system initialized with expert information outperforms the neural network. Index terms -applications, neural networks, reinforcement learning, genetic algorithms, qualitative reasoning, rule learning

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive critic based approximate dynamic programming: A new tool for smart manufacturing

This work supported in part by the National Science Foundation under grant ECS-9904378. Abstract Adaptive critic based approximate dynamic programming techniques are gradient based methods for finding optimal policies for multi-stage decision processes. We believe adaptive critic methods are now developed to the point that they can be applied to the full spectrum of decision and control problem...

متن کامل

IS-MRAS With On-Line Adaptation Parameters Based on Type-2 Fuzzy LOGIC for Sensorless Control of IM

This paper suggests novel sensorless speed estimation for an induction motor (IM) based on a stator current model reference adaptive system (IS-MRAS) scheme. The IS-MRAS scheme uses the error between the reference and estimated stator current vectors and the rotor speed. Observing rotor flux and the speed estimating using the conventional MRAS technique is confronted with certain problems relat...

متن کامل

Indirect Adaptive Interval Type-2 Fuzzy PI Sliding Mode Control for a Class of Uncertain Nonlinear Systems

Controller design remains an elusive and challenging problem foruncertain nonlinear dynamics. Interval type-2 fuzzy logic systems (IT2FLS) incomparison with type-1 fuzzy logic systems claim to effectively handle systemuncertainties especially in the presence of disturbances and noises, but lack aformal mechanism to guarantee performance. In contrast, adaptive sliding modecontrol (ASMC) provides...

متن کامل

Design and implementation of an adaptive critic-based neuro-fuzzy controller on an unmanned bicycle

Abstract: Fuzzy critic-based learning forms a reinforcement learning method based on dynamic programming. In this paper, an adaptive critic-based neuro-fuzzy system is presented for an unmanned bicycle. The only information available for the critic agent is the system feedback which is interpreted as the last action performed by the controller in the previous state. The signal produced by the c...

متن کامل

Voting Algorithm Based on Adaptive Neuro Fuzzy Inference System for Fault Tolerant Systems

some applications are critical and must designed Fault Tolerant System. Usually Voting Algorithm is one of the principle elements of a Fault Tolerant System. Two kinds of voting algorithm are used in most applications, they are majority voting algorithm and weighted average algorithm these algorithms have some problems. Majority confronts with the problem of threshold limits and voter of weight...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001